Search results for "html::treebuilder::libxml"
HTML::TreeBuilder::LibXML - HTML::TreeBuilder and XPath compatible interface with libxml
HTML::TreeBuilder::XPath is libxml based compatible interface to HTML::TreeBuilder, which could be slow for a large document. HTML::TreeBuilder::LibXML is drop-in-replacement for HTML::TreeBuilder::XPath. This module doesn't implement all of HTML::Tr...
TOKUHIROM/HTML-TreeBuilder-LibXML-0.26 - 19 Oct 2016 15:08:57 UTC - Search in distribution- HTML::TreeBuilder::LibXML::Node - HTML::Element compatible API for HTML::TreeBuilder::LibXML
WWW::GoKGS::LibXML - HTML::TreeBuilder::LibXML-based WWW::GoKGS
This class inherits all methods from WWW::GoKGS. Unlike "WWW::GoKGS", this class uses HTML::TreeBuilder::LibXML instead of HTML::TreeBuilder::XPath to parse HTML documents. Make sure to install the alternative module in addition to this module....
ANAZAWA/WWW-GoKGS-0.21 - 21 Aug 2014 02:27:48 UTC - Search in distribution- WWW::GoKGS::Scraper - Abstract base class for KGS scrapers
perlfaq6 - Regular Expressions
This section is surprisingly small because the rest of the FAQ is littered with answers involving regular expressions. For example, decoding a URL and checking whether something is a number can be handled with regular expressions, but those answers a...
ETHER/perlfaq-5.20240218 - 18 Feb 2024 17:37:36 UTC - Search in distribution
XML::Twig - A perl module for processing huge XML documents in tree mode.
This module provides a way to process XML documents. It is build on top of "XML::Parser". The module offers a tree interface to the document, while allowing you to output the parts of it that have been completely processed. It allows minimal resource...
MIROD/XML-Twig-3.52 - 23 Nov 2016 17:21:16 UTC - Search in distribution
WebSource::Parser - A XML/HTML parser extending XML::LibXML
A simple XML::LibXML extention to be more robust in parsing HTML by using HTML::TreeBuilder...
HABEGGER/WebSource-2.4.5 - 09 Jun 2010 10:22:18 UTC - Search in distribution
Text::Distill - Quick texts compare, plagiarism and common parts detection
GRIBUSER/Text-Distill-0.5
-
09 Oct 2020 11:13:44 UTC
-
Search in distribution
Task::Kensho - A Glimpse at an Enlightened Perl
Task::Kensho is a list of recommended, widely used and best-in-class modules for Enlightened Perl development. CPAN is wonderful, but there are too many wheels and you have to pick and choose amongst the various competing technologies. From <http://e...
ETHER/Task-Kensho-0.41 - 03 Jul 2021 03:40:21 UTC - Search in distribution
Web::Query - Yet another scraping library like jQuery
Web::Query is a yet another scraping framework, have a jQuery like interface. Yes, I know Ingy's pQuery. But it's just alpha quality. It doesn't work. Web::Query built at top of the CPAN modules, HTML::TreeBuilder::XPath, LWP::UserAgent, and HTML::Se...
YANICK/Web-Query-1.01 - 16 Jan 2024 20:28:14 UTC - Search in distribution- Web::Query::LibXML - fast, drop-in replacement for Web::Query
DOM::Tiny - This is an empty subclass, you wanted Mojo::DOM58
MSTROUT/DOM-Tiny-0.005
-
01 Aug 2016 16:12:04 UTC
-
Search in distribution
Bundle::SYP - SYP's cozy environment
SYP/Bundle-SYP-1.5
-
24 Jun 2014 07:39:33 UTC
-
Search in distribution
Task::BeLike::LESPEA - Modules that LESPEA uses on a daily basis
LESPEA/Task-BeLike-LESPEA-2.005000
-
12 Mar 2014 14:47:57 UTC
-
Search in distribution
Task::Sympa - Sympa dependencies
Installing this module will install all the modules needed for running Sympa mailing-list manager, ie: * Archive::Zip * CGI * DB_File * DBI * Digest::MD5 * Encode * File::Copy::Recursive * HTML::FormatText * HTML::StripScripts::Parser * HTML::TreeBui...
MARCC/Task-Sympa-1.03 - 29 May 2018 08:04:58 UTC - Search in distribution
Mojo::DOM58 - Minimalistic HTML/XML DOM parser with CSS selectors
Mojo::DOM58 is a minimalistic and relaxed pure-perl HTML/XML DOM parser based on Mojo::DOM. It supports the HTML Living Standard <https://html.spec.whatwg.org/> and Extensible Markup Language (XML) 1.0 <https://www.w3.org/TR/xml/>, and matching based...
DBOOK/Mojo-DOM58-3.001 - 16 Jun 2021 05:30:08 UTC - Search in distribution
XML::Lenient - extracts strings from HTML, XML and similarly tagged text.
What XML::Lenient is meant to parse markup languages such as HTML and XML in the knowledge that someone, somewhere, is going to break every rule in the book. It will handle malformed XML, wrongly nested HTML tags and everything else that I have throw...
DAVIES/XML-Lenient-1.0.1 - 15 Nov 2016 13:27:29 UTC - Search in distribution
Web::Scraper - Web Scraping Toolkit using HTML and CSS Selectors or XPath expressions
Web::Scraper is a web scraper toolkit, inspired by Ruby's equivalent Scrapi. It provides a DSL-ish interface for traversing HTML documents and returning a neatly arranged Perl data structure. The *scraper* and *process* blocks provide a method to def...
MIYAGAWA/Web-Scraper-0.38 - 20 Oct 2014 00:27:05 UTC - Search in distribution- Web::Scraper::LibXML - Drop-in replacement for Web::Scraper to use LibXML
Task::BeLike::TOKUHIROM - modules I use
This Task installs modules that I need to work with. They are listed in this distribution's cpanfile....
TOKUHIROM/Task-BeLike-TOKUHIROM-0.02 - 20 Mar 2014 01:35:41 UTC - Search in distribution
Module::Format::ModuleList - an ordered list of Module::Format::Module.
SHLOMIF/Module-Format-0.4.0
-
06 Apr 2020 14:21:10 UTC
-
Search in distribution
Tree::Transform::XSLTish - transform tree data, like XSLT but in Perl
This module allows you to transform tree with Perl subroutines, just like XSLT does for XML documents. It tries to model as closely as reasonable the semantic of XSLT....
DAKKAR/Tree-Transform-XSLTish-0.3 - 13 Feb 2011 14:45:10 UTC - Search in distribution
Bundle::DadaMailXXL - CPAN Bundle of ALL CPAN modules used in Dada Mail
"Bundle::DadaMailXXL" is a CPAN Bundle of ALL CPAN modules used by Dada Mail. "Bundle::DadaMailXXL" will pull modules listed in "Bundle::DadaMail::IncludedInDistribution" (CPAN modules usually bundled within the distro) and "Bundle::DadaMail" (module...
JJSIMONI/Bundle-DadaMailXXL-0.0.9 - 06 Jul 2023 22:55:41 UTC - Search in distribution
WWW::MobileCarrierJP - scrape mobile carrier information
Japanese Mobile Phone Carrier doesn't feed any information by the machine readable format :( This is good wrapper for this problem. This module makes machine readable format from HTML :)...
TOKUHIROM/WWW-MobileCarrierJP-0.65 - 30 Aug 2016 10:40:08 UTC - Search in distribution